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Field of the Invention 
This invention relates to computer networks and, specifically, to an improved technique 
for incrementally updating a one's complement checksum in a network data packet. 

Background of the Invention 
5 Internet use has increased dramatically over the past five years. Both the number of users 

connecting to the Internet and applications requiring greater bandwidth have created the need to 
increase network bandwidth. Consequently, network designers are migrating to higher speed 
backbones, such as Gigabit Ethernet, to compensate for increased network traffic. Higher speed 
backbones place greater demands on network switching performance because network switches 
10 must complete data packet processing more efficiently if data packets are to be switched at 
higher rates. 

In a packet switched network, network traffic consists of data packets that move from one 
network node to another. A node is any computer, printer, router, switch, gateway, or other 
device attached to the network. A network switch directs a data packet from one node to the 

15 next by reading information contained in the data packet header. Frequently, the network switch 
modifies one or more fields of the data packet header to complete this routing. This is especially 
true if the data packet crosses network domain boundaries or if the network switch performs 
operations such as load balancing or packet prioritization. 

For a network switch to route a data packet properly, the structure of the data packet must 

20 conform to the protocol of the network. A computer that places information on the Internet 
generally complies with the Open System Interconnection (OSI) model where the protocol is 
divided into seven layers. Information that is sent from one computer to the next is normally 
generated in the Application layer (layer 7) of the source node by the user or software. The 
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information is then sent to the Presentation layer (layer 6) where it is translated into a format that 
can be read by the destination node. Once translated, the information is then queued in the 
Session layer (layer 5) for managed control of the transmission. The Transport layer (layer 4), 
which receives the information from the Session layer, ensures that the data packet is 
5 successfully transferred between the two end nodes. The Network layer (layer 3) provides 
routing information for the data packet. The Data Link layer (layer 2), which includes the Media 
Access Control (MAC) sub-layer, monitors activity on the network before sending a data packet, 
recognizes boundaries between data packets, and regulates the flow of data packets between 
nodes. The Physical layer (layer 1) comprises the hardware that generates the actual signals that 
10 are transmitted through the physical medium of the connection, such as the wires of an Ethernet 
connection. 

To accomplish their functions, the Transport, Network, and Data Link layers encapsulate 
data with header information, which originates in the Application layer before the data is placed 
on the network. Each layer appends its information to the front of the information passed to it 

15 from the prior layer, thereby encapsulating the information from the prior layer. The result is a 
data packet that contains data and layers of encapsulated header information. 

Because a data packet contains layers of encapsulated information, the type of switching 
performed by a network switch depends upon the layer of information processed. A layer 2 
switch forwards a data packet based upon the physical address of the destination node found in 

20 header information supplied by the MAC sub-layer of the Data Link layer. Because layer 2 
switching usually occurs in hardware, it operates very quickly and is an efficient means of 
handling network transmissions within local area networks (LANs). 
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A network switch configured for layer 3 processing (usually referred to as a router) 
processes information added by the Network Layer. In an Internet Protocol (IP) based network, 
the layer 3 switch uses information from the IP header to calculate routes based upon logical 
addresses, such as IP addresses, rather than physical addresses. 
5 Layer 4 network switches process information supplied by the Transport Layer. In the 

Internet environment, the Transport Layer includes Transmission Control Protocol (TCP) 
segments and/or Universal Data Protocol (UDP) datagrams. A layer 4 network switch uses 
information, such as port numbers, protocol bits, and IP source and destination addresses, to 
move network traffic across the Internet. Load balancing and prioritization of certain data 
10 traffic, such as Voice over IP, are two examples of the types of functions that require layer 4 
processing. 

A network switch configured for layer 4 processing oftentimes modifies one or more 
fields of the IP, TCP, or UDP header to complete routing. For example, a data packet that 
crosses from one Internet domain to the next may undergo Network Address Translation (NAT). 

15 NAT allows IP addresses to be reused across Internet domains, thus reducing the problem of IP 
address depletion. To translate IP addresses, a network switch must modify an IP source or 
destination address of the IP header before forwarding the data packet to its next hop. Because 
the checksum value contained in the BP header is calculated over packet fields including the IP 
source and destination fields, the header checksum must also be modified. 

20 However, re-calculating a header checksum while maintaining switching performance 

requires substantial computational resources, especially if the network data packet contains a 
TCP segment or UDP datagram because the TCP or UDP checksum is calculated over both its 
header and data (payload) fields and the payload may be thousands of bytes long. To re-compute 
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a TCP or UDP checksum, a network switch can buffer the entire TCP segment or UDP datagram, 
modify the header information, re-calculate the checksum, and forward the data packet before 
the next data packet is received. Although a buffering approach is useful in some situations, it 
can require lots of memory and may slow the performance of a switch. 



header checksum incrementally over a portion of the TCP or UDP packet rather than by re- 
computing it over the entirety of the packet. Incremental update takes advantage of the additive 
properties of checksum calculation by adding and subtracting values to the original header 
checksum based upon the changes made to the data packet header. These short calculations 

10 eliminate the need to buffer and re-calculate the checksum over the entire packet. The details of 
incrementally updating the EP checksum can be found in the Internet Engineering Task Force 
RFC 1141 entitled "Incremental Updating of the Internet Checksum" and RFC 1624 entitled 
"Computation of the Internet Checksum via Incremental Update." The details of calculating an 
Internet checksum can be found in RFC 1071 entitled "Computing the Internet Checksum." All 

1 5 three titles are incorporated by reference herein. 

Although RFC 1141 and 1624 provide an improved technique for computing TCP/UDP 
checksums, these specifications do not fully realize the performance improvement available 
using incremental checksum computations. Accordingly, there is a need for an improved 
approach for incrementally calculating TCP and UDP checksums. 

20 Summary of the Invention 

It is an advantage of the present invention to provide a unique method for incrementally 
updating a header checksum in a network data packet. It is another advantage of the present 
invention to provide a unique method for updating a header checksum without having to 
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One way to increase network switch performance is to configure the switch to update the 
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recalculate it over the entire payload. Eliminating the need to recalculate the checksum over the 
payload reduces network switch latency and increases network switch performance. 

In accordance with an embodiment of the invention, a packet processor divides an IP 
packet encapsulating a TCP segment or UDP datagram into two parts: a reduced header part and 
5 a payload-plus part. The reduced header part contains the header fields normally modified 
during layer 4 processing. The reduced header part may include, for example, the IP source and 
destination addresses, protocol bits, and, when applicable, the IPv6 next header field of the IP 
packet header. It may also include the source and destination ports and the sequence and 
acknowledgement numbers of the TCP or UDP header. The payload-plus part normally contains 
10 the data or payload portion of the TCP segment or UDP datagram and any TP header and TCP or 
UDP header fields not specifically included in the reduced header part. These divisions are 
exemplary, and other divisions between the two parts are within the scope and spirit of the 
invention. 

Once the packet processor receives a data packet, the processor performs an intermediate 
15 calculation on the TCP or UDP checksum by subtracting from it a value equal to the reduced 
header part. Subtracting a value from a TCP or UDP checksum generally means adding the 
one's-complement sum of the value to the TCP or UDP checksum. The result is a checksum 
value for the payload-plus portion of the IP packet. This value is deemed, for purposes of the 
invention, the partial one's complement sum. 
20 Next, the processor modifies one or more fields of the reduced header part during layer 3 

or layer 4 processing and calculates a checksum value on the modified reduced header part. It 
then adds this checksum value to the partial one's complement sum to produce the incrementally 
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updated checksum. The incrementally updated checksum then replaces the original checksum in 
the TCP segment or UDP datagram header before the data packet is forwarded to its next hop. 

Other embodiments, methods, features and advantages of the invention will be or will 
become apparent to one with skill in the art upon examination of the following figures and 
5 detailed description. It is intended that all such additional embodiments, methods, features and 
advantages be included within the scope of the invention and be protected by the accompanying 
claims. 

Bri ef Description of the Drawings 

The components in the figures are not necessarily to scale, emphasis instead being placed 
10 upon illustrating the principles of the invention. In the figures, like reference numerals designate 
corresponding parts throughout the different views. 

FIG. 1 is a flow chart illustrating a method of incrementally updating a checksum field in 
a network data packet header in accordance with an embodiment of the invention. 

FIG. 2 is a diagram illustrating the encapsulation of a TCP segment as an IP datagram. 
15 The TCP segment appears in 32-bit wide format. 

FIG. 3 is a diagram illustrating the encapsulation of a UDP segment as an IP datagram. 
The UDP datagram appears in 32-bit wide format. 

FIG. 4 is a diagram illustrating an IPv4 header as it would appear in a 32-bit wide format. 
FIG. 5 is a diagram illustrating an IPv6 header as it would appear in a 32-bit wide format. 
20 FIG. 6 is a functional block diagram of a unit of a network switch in accordance with 

another embodiment of the invention. 
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Detailed Description 
Figure 1 illustrates a method 10 of incrementally updating a checksum value in a network 
data packet. 

A checksum is an error-checking value that allows quick verification of the integrity of a 
received data packet, or parts thereof, and in a TCP/IP or UDP/IP environment, is the sixteen-bit, 
one's-complement of the one's-complement sum of all of the octets over which the checksum is 
computed. A received data packet contains a checksum field having a checksum value 
calculated by the source node. The receiving node recalculates the checksum over the same 
fields and compares the recalculated sum with the value in the checksum field. If the two values 
are the same, it is assumed that the data packet was transmitted error free. 

The method steps comprise receiving a data packet containing a header checksum field at 
a network switch (step 20), computing a partial one's complement sum from the original header 
checksum (step 30), modifying one or more header fields of the data packet header (step 40), 
updating the header checksum field with a new header checksum value that reflects the changes 
made to the header fields (step 50), and transmitting the network data packet with the modified 
header to its next hop (step 60). Although the method itself can be applied generally to any 
network and protocol that allows arithmetic manipulation of its error correction scheme, the 
description below uses, by way of example only, an IP data packet containing a TCP segment or 
UDP datagram. 

Figure 2 shows the headers encapsulation of a TCP segment as an IP datagram. The 
source address 110, destination address 120, protocol bits 140, and other fields are part of the IP 
packet header 102. The IP packet header 102 precedes the TCP segment 104 in the IP packet 
100. The TCP segment 104 consists of a TCP header 106 and the data or payload 108. The TCP 
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header 106 includes a source port 160, a destination port 170, a sequence number 180, an 
acknowledgement number 190, control and offset fields 200, a TCP checksum 210, an urgent 
pointer 220, options 230, and padding 240 fields. 

Referring to Figures 1 and 2, when a packet processor (includable in a network switch) 
5 implementing the method of this invention receives an IP packet 100 (step 20), it divides it into , 
two parts for processing purposes, a reduced header part and a payload-plus part. In the 
preferred configuration, the reduced header part includes the source 110 and destination 120 
addresses, the protocol bits 140, the source 160 and destination ports 170, and the sequence 180 
and acknowledgement 190 numbers. Because the processor receives the EP header 102 and TCP 

10 header 106 before it receives the payload 108, the processor does not need to buffer the entire IP 
packet 100 before it begins processing the header fields. More importantly, because the method 
and system of this invention update the TCP checksum 210 without recalculating it over any 
portion of the payload 108, the method and system reduce network switch latency by limiting the 
number of mathematical computations to simple arithmetic computations over the header fields. 

15 Once the processor locates the header fields that make up the reduced header part, the 

processor performs an intermediate arithmetic calculation on the TCP checksum 210 (step 30). 
The calculation consists of subtracting a value equal to the reduced header part from the TCP 
checksum 210. Because the TCP checksum 210 is the ones-complement (mathematical 
operation) of the ones-complement sum (summation using ones-complement numbering system) 

20 of all of the values over which the checksum is calculated, the subtraction is accomplished by 
adding the ones-complement (mathematic operation) of the ones-complement sum (summation 
using ones-complement numbering system) of the reduced header part to the ones-complement 
of the TCP checksum 210. The result is the ones-complement sum of the payload-plus portion 
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of the IP packet 100 and, for purposes of this invention, is called the partial one's complement 
sum. It is created without having to re-compute it from the data or payload 108 portion of the IP 
packet 100. 

The Table 1 below illustrates this computation. 

TABLE 1 
Nomenclature 

HC - Original TCP Checksum 2 1 0 value in IP packet 1 00 

C - Original checksum value, the ones-complement sum of all 

fields in IP packet 100 over which checksum is computed 
PC - Partial Ones-complement sum 

HC - Updated TCP Checksum 210 replacing original TCP 

Checksum 2 1 0 in IP packet 1 00 
C - Updated checksum value of IP packet 100 containing 

modified header 

m - Ones-complement sum of the unmodified reduced header part 
m' - Ones-complement sum of the modified reduced header part 

Intermediate Calculations 
HC =~C [Eqn. 1] 

PC =C-m [Eqn. 2] 

= C + (-m) 

= C + ~m 

= - HC + ~ m [Eqn. 3] 

Incremental checksum Calculation Algorithm 

C =C + (-m) + m' ~ [Eqn. 4] 

C =PC + m [Eqn. 5] using Eqn. 
HC =~C 

= ~ (C + (-m) + m') Using Eqn. 4 

= ~ ((PC + m) + (-m) + m*) Using Eqn. 5 

= ~ (PC + m') [Eqn. 6] 



By convention, checksum C is the ones-complement sum of all of the fields of the IP 
packet 100 over which the checksum is computed and is equal to the ones-complement sum of 
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the reduced header part plus the ones-complement sum of the payload-plus part. Also by 
convention, the TCP Checksum 210 (HC) is the ones-complement of the checksum C This is 
shown in Equation 1 above. As shown in Equation 2, the partial one's complement sum (PC) is 
by definition the value obtained by subtracting the ones-complement sum of the reduced header 
5 part (m) from the checksum C. Because the checksum C is the ones-complement of the TCP 
Checksum 210 (HC), this subtraction is accomplished by adding the ones-complement of the 
ones-complement sum of the reduced header part to the ones-complement of the TCP Checksum 
210 (HC), as shown by derived Equation 3. The result is the ones-complement sum of the 
payload-plus portion of the IP packet 100. 

10 Referring now to Figure 6, during layer 3 or layer 4 processing, a packet processor 600 

receiving a network data packet may modify any of the IP 102 or TCP header 106 fields before 
forwarding the IP packet 100 to its next hop. As data packets arrive, they are written to the 
buffering FIFO 630. At the same time, the packets are also sent to the In Parser 610. The In 
Parser 610 extracts the IP 102 and/or TCP header 106 fields that make up the reduced header 

15 part. The In Parser 610 then calculates the partial one's complement sum using Equation 3 of 
Table 1 to operate on the extracted fields and the original TCP checksum 210. The results of the 
In Parser 610 are then written into the In Parser RAM 620 to be used by the Transmit Engine 
640. As data packets arrive from the buffering FIFO 630, the Transmit Engine 640 may modify 
any fields within the network data packet (step 40), including any fields within the reduced 

20 header part. It also reads the data from the In Parser RAM 620 and forwards the partial one's 

complement sum and header fields from the reduced header part to the Post Processor RAM. The 
Transmit Engine 640 can be a microprocessor, a programmable device, a dedicated integrated 
circuit, or a part thereof, or any other device capable of being programmed or hardwired to 
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perform data manipulation functions. The Transmit Engine 640 then forwards the modified data 
packet 100 to the Out Parser 660 for further processing. 

Similar in functionality to the In Parser 610, the Out Parser 660 extracts information 
about the header fields. However, it does not compute the partial one's complement sum from 
5 the modified data packet. In addition, it forwards the partial one's complement sum stored in the 
Post Processor RAM 650 to the Post Processor 670. The Post Processor 670 then computes a 
new TCP checksum 210 (HC) by adding the ones-complement sum of the modified reduced 
header part to the partial one's complement sum and then finally by taking the ones-complement 
of the sum, as illustrated by Equation 6 of Table 1. Once the new TCP checksum 210 (HC) is 

10 calculated, the Post Processor 670 replaces the original TCP checksum 210 (HC) in the IP packet 
100 with the new TCP checksum 210 (HC) (step 50) and forwards the IP packet 100 to its next 
hop (step 60). Like the Transmit Engine 640, the Out Parser 660 and the Post Processor 670 can 
be a microprocessor, a programmable device, a dedicated integrated circuit, or a part thereof, or 
any other device capable of being programmed or hardwired to perform data manipulation 

15 functions. 

Figure 3 shows the headers of an IP packet 300 containing a UDP datagram 304 that are 
typically modified during data packet processing. The difference between the processing of a 
UDP datagram 304 versus a TCP segment 104 is that a UDP header 306 contains only a source 
port 360, a destination port 370, and a UDP checksum 380. The UDP header 306 does not 
20 contain a sequence number 180 and an acknowledgement number 190. Thus, the only change in 
the method described above is in the fields that comprise the reduced header part. The method 
10 can be used to process an IP packet 300 containing a UDP datagram 304 where the reduced 
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header part preferably comprises the following fields: the source 310 and destination 320 
addresses, the protocol bits 340, and the source 360 and destination ports 370. 

Figure 4 illustrates an IPv4 header 400. The IPv4 header 400 includes, among other 
fields, protocol bits 410, an IPv4 header checksum 420, an IP source address 430, and an IP 
destination address 440. The IPv4 header 400 normally prefixes the TCP segment 104 of Figure 
2 or the UDP datagram 304 of Figure 3 in a network data packet when a sending node utilizes 
version 4 of the Internet Protocol. The method 10 treats data packets containing version 4 
headers no differently than the data packets 100, 300. However, the IPv4 header checksum 420 
is a checksum of the IPv4 header fields only, whereas the TCP checksum 210 and the UDP 
checksum 380 cover certain fields in the IP header 102, 302, TCP and UDP header 106, 306 
fields, and their payloads 108, 308. When processing a data packet containing an IPv4 header 
400 where one or more fields of the IPv4 header 400 are to be modified, the reduced header part 
of the preferred embodiment comprises the source 430 and destination 440 addresses and the 
protocol bits 410. 

Figure 5 illustrates an IPv6 header 500. Like the IPv4 header 400 of Figure 4 and the IP 
header 102 of Figure 2, it contains protocol bits 520, a source address 540, and a destination 
address 550. Unlike the IPv4 header 400 and the IP header 102, however, it contains a next 
header 510 field. The method disclosed herein treats data packets containing version 6 headers 
no differently than the IP data packets 100, 300, 400 described above, except that the reduced 
header part preferably includes the next header 510 field in addition to the protocol bits 520, the 
source address 540, and the destination address 550. 

While various embodiments of the invention have been described, it will be apparent to 
those of ordinary skill in the art that many more embodiments and implementations are possible 
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that are within the scope of this invention. For example, the method described above is not 
limited to the protocols specifically mentioned herein. It can apply to other protocols that allow 
arithmetic manipulation of their error correction schemes. 
What is claimed is: 
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